Multimodal Musician Recognition
نویسندگان
چکیده
This research is an initial effort in showing how a multimodal approach can improve systems for gaining insight into a musician’s practice and technique. Embedding a variety of sensors inside musical instruments and synchronously recording the sensors’ data along with audio, we gather a database of gestural information from multiple performers, then use machine-learning techniques to recognize which musician is performing. Our multimodal approach (using both audio and sensor data) yields promising performer classification results, which we see as a first step in a larger effort to gain insight into musicians’ practice and technique.
منابع مشابه
The SARC EyesWeb Catalog: A Pattern Recognition Toolbox for Musician-Computer Interaction
This paper presents the SARC EyesWeb Catalog (SEC), a group of blocks designed for real-time gesture recognition that have been developed for the open source program EyesWeb. We describe how the recognition of real-time body movements can be used for musician-computer-interaction.
متن کاملRepetitive stress and strain injuries: preventive exercises for the musician.
There are many articles that support stretching, strengthening, good nutrition, hydration, rest, and ergonomics along with many other concepts that may be helpful in preventing repetitive stress injuries. The most conclusive literature proposes early recognition of onset symptoms, and immediate reduction or cessation of the casual activity. This is not well accepted by the musician, because thi...
متن کاملA Robot Musician Interacting with a Human Partner through Initiative Exchange
This paper proposes a novel method to realize an initiative exchange for robot. A humanoid robot plays vibraphone exchanging initiative with a human performer by perceiving multimodal cues in real time. It understands the initiative exchange cues through vision and audio information. In order to achieve the natural initiative exchange between a human and a robot in musical performance, we built...
متن کاملA knowledge-based, data-driven method for action-sound mapping
This paper presents a knowledge-based, data-driven method for using data describing action-sound couplings collected from a group of people to generate multiple complex mappings between the performance movements of a musician and sound synthesis. This is done by using a database of multimodal motion data collected from multiple subjects coupled with sound synthesis parameters. A series of sound...
متن کاملA Multimodal Approach to Dictation of Handwritten Historical Documents
Handwritten Text Recognition is a problem that has gained attention in the last years due to the interest in the transcription of historical documents. Handwritten Text Recognition employs models that are similar to those employed in Automatic Speech Recognition (Hidden Markov Models and n-grams). Dictation of the contents of the document is an alternative to text recognition. In this work, we ...
متن کامل